AITopics

Country: Europe (0.28)

Genre: Instructional Material (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsMar-17-2026, 07:55:43 GMT

Neurally-Guided Procedural Models: Amortized Inference for Procedural Graphics Programs using Neural Networks

Probabilistic inference algorithms such as Sequential Monte Carlo (SMC) provide powerful tools for constraining procedural models in computer graphics, but they require many samples to produce desirable results. In this paper, we show how to create procedural models which learn how to satisfy constraints. We augment procedural models with neural networks which control how the model makes random choices based on the output it has generated thus far. We call such models neurally-guided procedural models. As a pre-computation, we train these models to maximize the likelihood of example outputs generated via SMC. They are then used as efficient SMC importance samplers, generating high-quality results with very few samples. We evaluate our method on L-system-like models with image-based constraints. Given a desired quality threshold, neurally-guided models can generate satisfactory results up to 10x faster than unguided models.

artificial intelligence, machine learning, proceedings, (4 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.61)
Information Technology > Artificial Intelligence > Machine Learning (0.45)

Rodríguez-Muñoz, Adrián, Baradad, Manel, Isola, Phillip, Torralba, Antonio

Separating Knowledge and Perception with Procedural Data

arXiv.org Artificial IntelligenceAug-19-2025

We train representation models with procedural data only, and apply them on visual similarity, classification, and semantic segmentation tasks without further training by using visual memory -- an explicit database of reference image embeddings. Unlike prior work on visual memory, our approach achieves full compartmentalization with respect to all real-world images while retaining strong performance. Compared to a model trained on Places, our procedural model performs within $1\%$ on NIGHTS visual similarity, outperforms by $8\%$ and $15\%$ on CUB200 and Flowers102 fine-grained classification, and is within $10\%$ on ImageNet-1K classification. It also demonstrates strong zero-shot segmentation, achieving an $R^2$ on COCO within $10\%$ of the models trained on real data. Finally, we analyze procedural versus real data models, showing that parts of the same object have dissimilar representations in procedural models, resulting in incorrect searches in memory and explaining the remaining performance gap.

artificial intelligence, machine learning, natural language, (14 more...)

2508.11697

Country:

North America > United States (0.93)
Europe (0.67)
Asia (0.67)

Genre: Research Report (0.41)

Industry:

Information Technology > Security & Privacy (0.68)
Health & Medicine (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

arXiv.org Artificial IntelligenceJan-29-2025

Synthesizing 3D Abstractions by Inverting Procedural Buildings with Transformers

Dax, Maximilian, Berbel, Jordi, Stria, Jan, Guibas, Leonidas, Bergmann, Urs

Training is fully supervised, We generate abstractions of buildings, reflecting the essential based on a dataset of procedural buildings paired aspects of their geometry and structure, by learning with corresponding point cloud simulations. We develop to invert procedural models. We first build a dataset of various technical components tailored to the generation of abstract procedural building models paired with simulated abstractions. This includes the design of a programmatic point clouds and then learn the inverse mapping through a language to efficiently represent abstractions, its combination transformer. Given a point cloud, the trained transformer with a technique to guarantee transformer outputs consistent then infers the corresponding abstracted building in terms with the structure imposed by this language, and an of a programmatic language description.

artificial intelligence, machine learning, natural language, (17 more...)

2501.17044

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Cieslak, Mikolaj, Govindarajan, Umabharathi, Garcia, Alejandro, Chandrashekar, Anuradha, Hädrich, Torsten, Mendoza-Drosik, Aleksander, Michels, Dominik L., Pirk, Sören, Fu, Chia-Chun, Pałubicki, Wojciech

Generating Diverse Agricultural Data for Vision-Based Farming Applications

arXiv.org Artificial IntelligenceMar-27-2024

We present a specialized procedural model for generating synthetic agricultural scenes, focusing on soybean crops, along with various weeds. This model is capable of simulating distinct growth stages of these plants, diverse soil conditions, and randomized field arrangements under varying lighting conditions. The integration of real-world textures and environmental factors into the procedural generation process enhances the photorealism and applicability of the synthetic data. Our dataset includes 12,000 images with semantic labels, offering a comprehensive resource for computer vision tasks in precision agriculture, such as semantic segmentation for autonomous weed control. We validate our model's effectiveness by comparing the synthetic data against real agricultural images, demonstrating its potential to significantly augment training data for machine learning models in agriculture. This approach not only provides a cost-effective solution for generating high-quality, diverse data but also addresses specific needs in agricultural vision tasks that are not fully covered by general-purpose models.

dataset, real image, synthetic image, (16 more...)

2403.18351

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Missouri > Boone County > Columbia (0.04)
North America > United States > Michigan (0.04)
(4 more...)

Genre: Research Report (0.64)

Industry: Food & Agriculture > Agriculture (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsMar-12-2024, 10:16:46 GMT

Neurally-Guided Procedural Models: Amortized Inference for Procedural Graphics Programs using Neural Networks

particle, procedural model, target image, (13 more...)

Country:

North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Asia (0.04)

Genre:

Research Report (0.46)
Instructional Material (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Artificial IntelligenceMay-8-2023

DeepTree: Modeling Trees with Situated Latents

Zhou, Xiaochen, Li, Bosheng, Benes, Bedrich, Fei, Songlin, Pirk, Sören

In this paper, we propose DeepTree, a novel method for modeling trees based on learning developmental rules for branching structures instead of manually defining them. We call our deep neural model situated latent because its behavior is determined by the intrinsic state -- encoded as a latent space of a deep neural model -- and by the extrinsic (environmental) data that is situated as the location in the 3D space and on the tree structure. We use a neural network pipeline to train a situated latent space that allows us to locally predict branch growth only based on a single node in the branch graph of a tree model. We use this representation to progressively develop new branch nodes, thereby mimicking the growth process of trees. Starting from a root node, a tree is generated by iteratively querying the neural network on the newly added nodes resulting in the branching structure of the whole tree. Our method enables generating a wide variety of tree shapes without the need to define intricate parameters that control their growth and behavior. Furthermore, we show that the situated latents can also be used to encode the environmental response of tree models, e.g., when trees grow next to obstacles. We validate the effectiveness of our method by measuring the similarity of our tree models and by procedurally generated ones based on a number of established metrics for tree form.

artificial intelligence, deep learning, machine learning, (17 more...)

2305.05153

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Virginia (0.04)
North America > United States > Maryland (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Comunità, Marco, Phan, Huy, Reiss, Joshua D.

Neural Synthesis of Footsteps Sound Effects with Generative Adversarial Networks

arXiv.org Artificial IntelligenceOct-18-2021

To this day, there has not yet been an attempt at exploring the use of neural networks for the synthesis of footsteps sounds although Footsteps are among the most ubiquitous sound effects in multimedia there is substantial literature exploring neural synthesis of broadband applications. There is substantial research into understanding impulsive sounds, such as drums samples, which have some the acoustic features and developing synthesis models for footstep similarities to footsteps. One of the first attempts was in [15], where sound effects. In this paper, we present a first attempt at adopting Donahue et al. developed WaveGAN - a generative adversarial network neural synthesis for this task. We implemented two GAN-based architectures for unconditional audio synthesis. Another example of neural and compared the results with real recordings as well as synthesis of drums is [16], where the authors used a Progressive six traditional sound synthesis methods. Our architectures reached Growing GAN. Variational autoencoders [17] and U-Nets [18] realism scores as high as recorded samples, showing encouraging have also been used for the same task.

architecture, discriminator, synthesis, (15 more...)

2110.09605

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Genre: Research Report (0.82)

Industry:

Leisure & Entertainment (1.00)
Media > Music (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsDec-31-2016

Neurally-Guided Procedural Models: Amortized Inference for Procedural Graphics Programs using Neural Networks

Ritchie, Daniel, Thomas, Anna, Hanrahan, Pat, Goodman, Noah

Probabilistic inference algorithms such as Sequential Monte Carlo (SMC) provide powerful tools for constraining procedural models in computer graphics, but they require many samples to produce desirable results. In this paper, we show how to create procedural models which learn how to satisfy constraints. We augment procedural models with neural networks which control how the model makes random choices based on the output it has generated thus far. We call such models neurally-guided procedural models. As a pre-computation, we train these models to maximize the likelihood of example outputs generated via SMC. They are then used as efficient SMC importance samplers, generating high-quality results with very few samples. We evaluate our method on L-system-like models with image-based constraints. Given a desired quality threshold, neurally-guided models can generate satisfactory results up to 10x faster than unguided models.

particle, procedural model, target image, (12 more...)

Country:

North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Asia (0.04)

Genre:

Research Report (0.46)
Instructional Material (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Ritchie, Daniel, Thomas, Anna, Hanrahan, Pat, Goodman, Noah D.

Neurally-Guided Procedural Models: Amortized Inference for Procedural Graphics Programs using Neural Networks

arXiv.org Artificial IntelligenceOct-13-2016

Probabilistic inference algorithms such as Sequential Monte Carlo (SMC) provide powerful tools for constraining procedural models in computer graphics, but they require many samples to produce desirable results. In this paper, we show how to create procedural models which learn how to satisfy constraints. We augment procedural models with neural networks which control how the model makes random choices based on the output it has generated thus far. We call such models neurally-guided procedural models. As a pre-computation, we train these models to maximize the likelihood of example outputs generated via SMC. They are then used as efficient SMC importance samplers, generating high-quality results with very few samples. We evaluate our method on L-system-like models with image-based constraints. Given a desired quality threshold, neurally-guided models can generate satisfactory results up to 10x faster than unguided models.

artificial intelligence, machine learning, procedural model, (15 more...)

1603.06143

Country: Europe (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)